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Abstract 

We describe two quantum algorithms to approximate the mean value of a 
black-box function. The first algorithm is novel and asymptotically optimal while 
the second is a variation on an earlier algorithm due to Aharonov. Both algorithms 
have their own strengths and caveats and may be relevant in different contexts. 
We then propose a new algorithm for approximating the median of a set of points 
over an arbitrary distance function. 
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1 Introduction 

Let F : {0, . . . , JV — 1} — > [0, 1] be a function and m = jj YaLi F ( i ) be its mean - 
When F is given as a black box (i.e. an oracle), the complexity of computing the mean 
can be measured by counting the number of queries made to this black box. The first 
quantum algorithm to approximate the mean was given by Grover, whose output of 
the estimate rh was such that \m — rh\ ^ e after loglog(j)) queries to the black 
box Q. Later, Nayak and Wu [10] have proven that to get such a precision, 0(l/e) 
calls to F are necessary, which still left a gap between the lower and upper bounds for 
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this problem. In this paper, we close this gap by presenting an asymptotically optimal 
algorithm to approximate the mean. We also describe a second algorithm that is a varia- 
tion of Aharonov's algorithm (T), which may be more suitable than the first one in some 
contexts. 

Afterwards, these two algorithms to approximate the mean are used in combination 
with the quantum algorithm for finding the minimum of Diirr and H0yer [ 6 ] to obtain a 
quantum algorithm for approximating the median among a set of points with arbitrary 
black-box distance function between these points. The median, which is defined as the 
point with minimum average (or total) distance to the other points, can be thought of 
as the point that is the most representative of all the other points. Note that this is very 
different from the simpler problem of finding the median of a set of values, which has 
already been solved by Nayak and Wu ifTOTl . Our median-finding algorithm combines 
the amplitude estimation technique of Brassard, H0yer, Mosca and Tapp Q with the 
minimum-finding algorithm of Diirr and H0yer Q. 

The outline of the paper is as follows. In Section [2j we present all the tools that 
we need, including Grover's algorithm, the quantum algorithm for computing the mini- 
mum of a function and the amplitude estimation technique. In Section [3l we describe 
two efficient algorithms to approximate the mean value of a function, which we use in 
Section @] to develop our novel quantum algorithm for approximating the median of an 
ensemble of points for which distances between them are given by a black box. Finally, 
we conclude in Section [5] with open questions for future work. 

2 Preliminaries 

In this section, we briefly review the quantum information processing notions that are 
relevant for understanding our algorithms. A detailed account of the field can be found 
in the book of Nielsen and Chuang [11]. 

As is often the case in the analysis of quantum algorithms, we shall assume that 
the input to the algorithms is given in the form of a black box (or "oracle") that can be 
accessed in quantum superposition. In practice, the quantum black box will be imple- 
mented as a quantum circuit that can have classical inputs and outputs. We shall count 
as our main resource the number of calls (also called "evaluations") that are required to 
that black box. 
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Theorem 2.1 (Search [[U 13). There exists a quantum algorithm that takes an arbi- 
trary function F : {0, . . . , N — 1} —¥ {0, 1} as input and finds some x such that 
F(x) = 1 if one exists or outputs "void" otherwise. Any such x is called a "solution". 
The algorithm requires 0(y/~N ) evaluations of F if there are no solutions. If there are 
s > solutions, the algorithm finds one with probability at least 2 /s after 0(^N/s ) 
expected evaluations of F. This is true even if the value of s is not known ahead of time. 

Following Graver's seminal work, Diirr and H0yer [6] have proposed a quantum 
algorithm that can find the minimum of a function with a quadratic speed-up compared 
to the best possible classical algorithm. 

Theorem 2.2 (Minimum Finding (6) |5l). There exists a quantum algorithm minimum 
that takes an arbitrary function F : {0, . . . ,N — 1} —> Y as input (for an arbitrary 
totally ordered range Y) and returns a pair (i,F(i)) such that F(i) is the minimum 
value taken by F. The algorithm finds a correct answer with probability at least 3 / 4 
after 0(y/~N ) evaluations of F. 

Another extension of Graver's algorithm makes it possible to approximately count 
the number of solutions to a search problem [4J. It was subsequently formulated as 
follows. 

Theorem 2.3 (Counting Q). There exists a quantum algorithm count that takes an 
arbitrary function F : {0, . . . , N — 1} — > {0, 1} as input as well as some positive 
integer t. If there are s values of x such that F(x) = 1, algorithm COUnt(F, t) outputs 
an integer estimate sfor s such that 

Js(N - s) 7T 2 N 

\s-s\< 2tt^^ + — T - 

t t z 

with probability at least 8/n 2 after exactly t evaluations of F. In special case s = 0, 
COUnt(F, t) always outputs perfect estimate s = 0. 

The following theorem on amplitude amplification is also adapted from 0. Its state- 
ment is rather more technical than that of the previous theorems. 

Theorem 2.4 (Amplitude estimation [3]). There exists a quantum algorithm ampli- 
tude-estimation that takes as inputs two unitary transformations A and B as well as 
some positive integer t. If 

A|0) = a|V>o) + /3|V>i) 
(where and \ip\) are orthogonal states and |0) is of arbitrary dimension) and 

£foto>|0) = |V>o>|0> and S^i)|0) = |Vi> |1> , 
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then amplitude-estimation(A, B, t) outputs d, an estimate of a = ||/3|| 2 , such that 



\/a(I — a) 7r 2 
t t z 

with probability at least 8/ir 2 at a cost of doing t evaluations each of A, A^ 1 and B. 

We shall also need the following technical result, which we derive using standard 
Chernoff bound arguments. 

Theorem 2.5 (Majority). Let B be a quantum black box that approximates some func- 
tion 

F : {0, . . . , iV — 1} — >■ {0, . . . , M — 1} 
such that its output is within A of the true value with probability at least 2 /^, i.e. 

B V) 1°) = ^Z a ij N) \ x ij) and XT \ a ij?^ 2 /-i 

3 {j:\x t] ~F(i)\^} 

for all i. Then, for all n there exists a quantum black box B n that computes F with its 
output within 2 A of the true value with probability at least 1 — 1/n, i.e. 

B n \i) |0) = /jfiij \i) \Vij) and |/3jj| 2 ^ 1 - 1/n 

for all i. Algorithm B n requires 0(log n) calls to B. 

Proof. Given an input index i, B n calls k times black box B with input i, where 
k = [(lgn)/Z)(||||)] and denotes the standard Kullback-Leibler divergence (9l 

(sometimes called the relative entropy). If there exists an interval of size 2A that con- 
tains at least 3 /s of the outputs, then B n outputs the midpoint of that interval. If there 
is no such interval (a very unlikely occurrence), then B n outputs 0. If at least 3 /s of 
the outputs are within A of F(i), then the output of B n cannot be further than 2 A from 
F(i) since the interval selected by B n must contain at least one of those points. By the 

Chernoff bound, this happens with probability at least 1 — 2 V5||3y^x — 1/n. □ 

Hereinafter, we shall denote by majority(£>, n) the black box B n that results from 
using this algorithm on black box B with parameter n. Note that D (| |||) > 1/100, 
hence majority(i?, n) requires less than 100 lg n calls to B. Note also that the number 
of calls to B does not depend on A. 
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3 Two Efficient Algorithms to Approximate the Mean 



We present two different algorithms to compute the mean value of a function. In both 
algorithms, let F : {0, . . . , N — 1} — > [0, 1] be a black-box function and let 
m = jj Ylx F{ x ) be the mean value of F, which we seek to approximate. Without 
loss of generality, we assume throughout that N is a power of 2. The first algorithm 
assumes that F(x) can be obtained with arbitrary precision at unit cost while the second 
algorithm considers that the output of function F is given with I bits of precision. 

Algorithm 1 meanl (F, N, t) 

Let ^ 

A' \x) |0> = \x) {y/l-F{x)\Q) + y/Fjx)\l) ) 

and ,4 = A'(H® lgAf ®ld), 

where H is the Walsh-Hadamard transform and Id denotes the identity 

transformation on one qubit 

Let 

£|*)|0>|0> = |*>|0>|0> 

and 

B\x) |1) |0) = \x) |1) |1) 

return amplitude_estimation(^4, B, t) 

Note that in AlgorithmQ] it is easy to implement A' (and therefore A as well as A^ 1 ) 
with only two evaluations of F. First, F is computed in a ancillary register initialized 
to |0), then the appropriate controlled rotations are performed, and finally F is computed 
again to reset the ancillary register back to 1 0) . (In practice, this transformation will be 
approximated to a prescribed precision.) The following theorem formalizes the result 
obtained by this algorithm. 

Theorem 3.1. Given a black-box function F : {0, . . . , N — 1} —> [0, 1] and its mean 
value m = jj F(x), algorithm meanl outputs rh such that \rh — m\ GO (-r) with 
probability at least 8 /it 2 . The algorithm requires At evaluations of F. 



Proof. Using the same definition as in Theorem 12.41 we have that 



, V JV 

The algorithm amplitude_estimation( J 4, B, t) returns an estimate rh = a of 
jj J2 X F(x) = m and thus rh is directly an estimate of m. The error 
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rh — m is at most 



\J m{l — m) tt 2 




) 



(1) 



t t 2 



with probability at least 8/tt 2 . This requires At evaluations of F because each of the t 



This theorem states that the error goes down asymptotically linearly with the num- 
ber of evaluation of F. This is optimal according to Nayak and Wu [10], who have 
proven that in the general case in which we have no a priori knowledge of the possi- 
ble distribution of outputs, an additive error of e requires an amount of work propor- 
tional to \/e in the worst case when the function is given as a black box. Note that for 
t < (1 + y/2)n / y/m, when our bound on the error exceeds the targeted mean (which is 
rather bad), the error goes down quadratically (which is good). 

We now present a variation on an algorithm of Aharonov [1] and analyse its charac- 
teristics. This algorithm is also based on amplitude estimation, but it relies on the fact 
that points in real interval [0, 1] can be represented in binary as £-bit strings, where £ 
is the precision with which we wish to consider the output of black-box function F. 
The algorithm estimates the number of Is in each binary position. The difference 
between our algorithm (mean2) and Aharonov's original algorithm is that we make 
sure that the estimates of the counts in every bit position are all simultaneously within 
the desired error bound. For each i between 1 and I, let Fi(x) represent the i th bit 
of the binary expansion of F(x), so that F(x) = Y2i Fi{x)2~ l , with the obvious case 
Fi(x) = 1 for all i when F(x) = 1. 



Algorithm 2 mean2(F, N) 


for % = 1 to £ do 




rhi = majority (county (a;), 5ttVn) 


,n=\l£]) 


end for 




return m = ^ Ya=i 





Theorem 3.2. Given a black-box function F : {0, . . . , N — 1} — > [0, 1] where the 
output of F has £ bits of precision, algorithm mean2 outputs an estimate rh such 
that \rh — m\ ^ jf ^ y / m72~*, where rrii = ^2 X Fi(x), with probability at least 2 /^. 
The algorithm requires 0{yN ^log^) evaluations of F. 

Proof. The proof is a straightforward corollary of Theorems I2.3l and l2.5l Using count 
on each column with s = bir\/N yields an error of 



calls on A and on A 



requires 2 evaluations of F. 



□ 
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with probability at least 8/7T 2 , and hence with probability at least 2 /%, where rhi denotes 
COUnt(Fj(x), 57r\/iV ). Using majority with n = \\V\ on this, we obtain 

\m.i - rhi\ ^ §V"^+ m 

with probability at least 1 — Jl. When rtii ^ 1, this is bounded by ■ s /mi. Furthermore, 
count makes no error when m, = 0. Hence, the error in each column is bounded by 
■s/rnl with probability at least 1 — J|. By the union bound, all of our estimates for the 
columns are simultaneously within the above error bounds with probability at least 2 /s, 
and the error bound on our final estimate is \rh — m\ ^ i Yl \/^72 _ \ It is straightfor- 
ward to count the number of evaluations of F from Theorems I2.3l andl 2.5l □ 

The choice of which among algorithms meanl or mean2 is more appropriate 
depends on the particular characteristics of the input function. For example, one can 
consider the situation in which F{x) = 2~ N for all x, hence the mean m = 2~ N as 
well. In this case, if we choose t = cN 3 / 2 lgN in meanl and £ = N in mean2, 
where constant c is chosen so that both algorithms call function F the same number 
of times, the first algorithm is in the regime t <C (1 + \/2)ir/y/rn where it performs 
badly because the error on the estimated mean is expected to be much larger than the 
mean itself. On the other hand, the expected error produced by the second algorithm 
is bounded by m/ y/N, which is much smaller than the targeted mean. At the other 
end of the spectrum, if F{x) = 72 f° r an x > hence the mean m = 72 as well, and if 
t 3> 2tt^/N, then the error produced by meanl is much smaller than m/yN accord- 
ing to Equation (Q}. With the same parameters, the error produced by mean2, which is 
again bounded by m/ \/~N, is strictly unaffected by the choice of £, so that the second 
algorithm can work arbitrarily harder than the first, yet produce a less precise estimate 
of the mean. 



4 Approximate Median Algorithm 

Let dist: {0, . . . , N — 1} X {0, . . . , N — 1} ->• [0, 1] be an arbitrary black-box distance 
function. 

Definition 4.1 (Median). The median is the point within an ensemble of points whose 
average distance to the other points is minimum. 

Formally, the median of a set of points Q = {0, . . . , N — 1} is 

N-l 

median(Q) = argmin dist(z, j). 

zeQ j=Q 
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The median can be found classically by going through each point z G Q, computing 
the average distance from z to all the other points in Q, and then taking the minimum 
(ties are broken arbitrarily). This process requires a time of 0{N 2 ). In the general case, 
in which there are no restrictions on the distance function used and no structure among 
the ensemble of points that can be exploited, no technique can be more efficient than 
this naive algorithm. Indeed, consider the case in which all the points are at the same 
distance from each other, except for two points that are closer than the rest of the points. 
These two points are the medians of this ensemble. In this case, classically we would 
need to query the oracle for the distances between each and every pair of points before 
we can identify one of the two medians. (We expect to discover this special pair after 
querying about half the pairs on the average but we cannot know that there isn't some 
other even closer pair until all the pairs have been queried.) This results in a lower 
bound of Q(iV 2 ) calls to the oracle. 

In Algorithm 3, mean stands for either one of the two algorithms given in the pre- 
vious section (in case meanl is used, parameter t must be added) but it is repeated 
0(log N) times in order to get all the means within the desired error bound with a con- 
stant probability via our majority algorithm (Theorem I2. 5 1 ). Here, dj = J2j dist(i, j) 
and d m - m = for any k such that dk ^ d{ for all i. 

Algorithm 3 median(dist) 

For each i, define function Fi(x) = dist(«, x) 

For each i, define Jj = majority (mean(Fj, N), n = N 2 ) 

return minimum^,) 

Theorem 4.2. For any black-box distance function 

dist : {0, . . . , JV - 1} x {0, . . . , JV - 1} -»• [0, 1] , 

when meanl is used with parameter t, algorithm median outputs an index j such 
that \dj — d m i n \ G 0(l/t) with probability at least 2 j^. The algorithm requires 
0(ty/N log N) evaluations ofdist. 

Proof. This result is obtained by a straightforward combination of Theorems 12.21 12.51 
and 13. II The procedure majority is used with parameter n = N 2 to ensure that all the 
di's computed by the algorithm (in superposition) are simultaneously within the bound 
given by Theorem 13. II except with probability o(l). Note that with parameter n = N 2 , 
the number of repetitions is still in 0(log N). The success probability of the algorithm 
follows from the fact that 1(1 — o(l)) > 2 jz- In this case, the error is in 0{l/t) and the 
number of evaluations of dist is in 0(t^/N log N). □ 
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By replacing meanl by mean2 in the median algorithm we obtain the following 
theorem. 

Theorem 4.3. For any black-box distance function 



with probability at least (See algorithm mean2 for a definition of mi and I.) 
The algorithm requires O(NlogN) evaluations of dist 

5 Conclusion 

We have described two quantum algorithms to approximate the mean and their appli- 
cations to approximate the median of a set of points over an arbitrary distance function 
given by a black box. We leave open for future work an in-depth study on how the 
different behaviour of the two algorithms impact the quality of the median they return. 
For instance, we know that the behaviour of both algorithms for the mean depends on 
the distribution of data points and the distances between points, but we still have to 
investigate more precisely the exact context where it matters. Of course, understanding 
the behaviour of the algorithms in different contexts is important, but a more interesting 
question is to tailor the algorithm to obtain better results on different data distributions 
of interest. 
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